NODAL LENGTH FLUCTUATIONS FOR ARITHMETIC 

RANDOM WAVES 



MANJUNATH KRISHNAPUR, PAR KURLBERG, AND IGOR WIGMAN 

Abstract. Using the spectral multiplicities of the standard torus, we 
endow the Laplace eigenspaces with Gaussian probability measures. 
This induces a notion of random Gaussian Laplace eigenfunctions on 
the torus ("arithmetic random waves"). We study the distribution of 
the nodal length of random eigenfunctions for large eigenvalues, and our 
primary result is that the asymptotics for the variance is non-umversal, 
and is intimately related to the arithmetic of lattice points lying on a 
circle with radius corresponding to the energy. 



1. Introduction 

The purpose of this paper is to investigate the variance of the fluctations 
of nodal lengths of random Laplace eigenfunctions on the standard 2-torus 
T := M^/Z^. The nodal set of a function / is simply the zero set of /, and 
if / : T — )• R is a Laplace eigenfunction, i.e., if / is non-constant and 

(1) Af + Ef = 0, E>0, 

then the nodal set of / consists of a union of smooth curves outside a finite 
set of singular points (see [12]) and hence length(/~^(0)), the nodal length 
of f, is well defined. 

A fundamental conjecture by Yau \27\ [28] asserts that for any smooth 
compact Riemannian manifold M, there exist constants C2(M) > ci(M) > 
such that 

(2) ci(M) • Ve < Yol{f-\0)) < C2(M) • Ve 

for any Laplace eigenfunction f on M with eigenvalue E. By the work of 
Donnelly & Fefferman [15] and Briining & Gromes [3 [10], Yau's conjecture is 
known to be true for manifolds with real analytic metrics, and in particular 
for M = T. 

For T = MV^^ the sequence of eigenvalues, or energy levels, are related 
to integers expressible as a sum of two integer squares; if we define S := {n : 
n = + b'^, a,b ^ Z}, the eigenvalues are all of the form 

(3) En := 47r2n, n e S. 
For n & S, let 

A„ := {A G ^2 : = n} 
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denote the corresponding frequency set. Using the standard notation e{z) := 
exp(27riz), the C-eigenspace £n corresponding to En is spanned by the L^- 
orthonormal set of functions {e ((A, x))}^^^J^ . The dimension of denoted 

by 

Mn ■■= dimSn = T2(n) = |A„| 
is equal to the number r2{n) of different ways n may be expressed as a sum 
of two squares. 

1.1. Arithmetic random waves. The set can be identified with the 
set of lattice points lying on a circle with radius y/n, and its properties are 
intimately related to representations of integers by the quadratic form + 
y^. The frequency set is thus of arithmetic nature. A particular consequence 
is that the sequence of spectral multiplicities {Mn}n>i is unbounded. It is 
thus natural to consider properties of "generic" , or "random" , eigenfunctions 
fn £ £n, and our primary interest is the high energy asymptotics of the 
distribution of their nodal length C{fn) as n tends to infinity in such a way 
that Mn oo. More precisely, let /„ : T — M be the random Gaussian field 
of (real valued) functions with eigenvalue En, i.e., 

(4) fn{x) = Yl ' 

where ax = b\ + ic\ are independent standard complex Gaussian random 
variables, save for the relations a_x = ax. This just means that bx,cx ~ 
^"(0, 1) are standard real Gaussians satisfying the relation b^x = bx, c^x = 
—cx and otherwise independent. Our object of study is the random variable 

J^n :=£(/„) =length(/-i(0)), 

henceforth called the nodal length of /„. 



1.2. Prior work on this model. In this setting, Rudnick and Wigman 
computed the expected nodal length of /„ to be E[i2„,] = • \/E^, in 
agreement with Yau's conjecture ([2]). They also gave the bound 

for the variance, and conjectured that the stronger bound 



(5) Var(/:„) = O 



(6) 



Var(.„)^0(^) 



holds. A nice consequence of ([5]) is that C{fn) concentrates around its mean. 
More precisely, there is a sequence 5„, — t- such that 



P (^(1 - Sn)^ < C{fn) < (1 + ^n)^j ^1 aS A; ^ OO. 

In this paper we shall determine the leading order asymptotic of Var(£„) 
as Mn — >• oo. As consequence we improve on the conjectured bound ([6|) and 
obtain the sharp bounds 

^«Va,.,A.)«^. 
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It turns out that the asymptotic behaviour of the variance is non-universal 
in the sense that it depends on the angular distribution of the points in the 
frequency set A„. In the proof, a leading order sum involving many terms 
of size En/Mn surprisingly cancels perfectly, and the variance is therefore 
much smaller than expected! We may say that T exhibits "arithmetic Berry 
cancellation" (cf. Section ri.6.2p . 

1.3. Our results. In order to describe our results we shall need some fur- 
ther notation. The set A„ induces a discrete probability measure on the 
circle 5^ = E C : |z| = 1} by defining 

where 5x is the Dirac delta measure supported at x. The Fourier transform 
of //„, is, for any /c G Z, as usual given by Jlnik) ■= f^i z~^dfin{z). For n & S, 
we define 

(8) c„ 1±^; 

\ J " 512 ' 

it is then easy to see that Cn is real and that c„ G [1/512,1/256]. (Since 
An is invariant under the transformations z ^ 'z and z — )• i ■ z, the same 
holds for Hn, hence /in (4) S M. Further, since is a probability measure, 
|/in(4)| < 1, and consequently /in(4)^ G [0, 1].) 
We can now formulate our principal result. 

Theorem 1.1. If {ni)i>i is any sequence of elements in S such that Mm 
oo, then 

(9) Var(£„J = c„, -^(1 + 0(1)). 

Further, given any c € [1/512,1/256], there exists a sequence (nj)i>i of 
elements in S such that as i ^ oo, we have Mm — >• oo together with Cm c 
so that 

Var(£„J = c.^(l + o(l)). 

Til 

1.4. Attainable measures. The second part of the theorem, in light of 
the first one, amounts to the following: given any a G [0, 1], there exists a 
sequence (nj)i>i such that /I„^(4)^ — )• a. We briefly describe the measures 
giving rise to the extremal points as well intermediate values attainable by 
Cn (cf. Section [7] for full details, in particular the precise notions of generic 
and thin used below.) 

It is well known that the lattice points A„ are equidistributed on along 
generic subsequences of energy levels, see e.g. [TB], Proposition 6. Thus, for 
{ni)i>i a generic sequence of elements in 5 the variance is minimal in the 
limit since fimi^) 0, and thus Cm 1/512. It is also worthwhile mention- 
ing that the nodal length variance of fn for such generic sequences differs by 
an order of magnitude from the corresponding quantity for superposition of 
random planar waves with same wavelength and directions chosen uniformly 
on the unit circle (especially noteworthy in light of the fact that both have 
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the same scaling properties. See the last paragraph of Section 11.6.11 for a 
more detailed explanation). 

As for the maximum, Cilleruelo [13] has shown that there are thin se- 
quences (nj)j>i, with Mm — )• oo, such that converges weakly to an 
atomic probability measure supported at the 4 symmetric points ±1, ±i; 
hence Jlmi^) 1 and c„- — )• 1/256. For the intermediate values we con- 
struct thin sequences (ri.t)j>i of elements in S such that converges weakly 
to the uniform probability measure supported on a union of four arcs. 

More precisely, for a G [0, ^] define a probability measure Ua on by 

3 



(10) lE-^^ 



4 

k=0 



where ★ stands for convolutions of measures, and Da is the uniform measure 
on [—a, a] (identifying = M/27rZ). More exphcitly, 



fc . 



For a = 0, we shall use the notational convention that i^o = j ^^=0 

Proposition 1.2. For every a E [0, j] there exists a sequence of energy 
levels, such that ;U„. =^ Va with Ua as in (|iup . In particular, for every 
b G [0, 1], there exists a sequence £^„. of energy levels such that c^. — t- -g^. 

Note that the second statement follows from the first because the values 
of z?a(4) ranges over the whole of [0, 1] as a ranges over [0, j] (in fact, an 

easy computation shows that z?a(4) = '^'"^^"^ .) Further, the extremal values 
6 = and 6 = 1 are attained by = i^^, the uniform measure on 5^, and 
ly = i/Q, the atomic symmetrized measure. The proof of Proposition 11.21 will 
be given in Section [71 

1.5. Independence of eigenbasis choice and the covariance function. 

The random field (HI) is centered, Gaussian and stationary in the sense that 
for any xi, . . . ,Xk S T and y GT, the random vector 

(/n(xi+y),...,/„(xfc + y))G]R'= 

is a mean zero multivariate Gaussian, whose distribution does not depend on 
y. The covariance functiorQ '"(x) = r„(x) := E[/„(y)/„(x -|- y)] thus depends 
only on x, and we may express it explicitly as 

(11) r^ix) = 5^ e ((A, x)) = ^Yl (2^^^' ^)) ' 

■^^^ AeA -^^^ AeA 

Though the normalizing factor in the definition Q of /„ has no bearing 
on the nodal length, it is convenient to work with, and we have chosen to 
have r„(0) = 1, or, equivalently, for every x £ T, E[/„(x)^] = 1. 



-'^The covariance function is widely referred to as the 2-point function in the physics 
Uterature. 
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The covariance function determines the distribution of a centered Gauss- 
ian random field, and, in principle, one may express any aspect of the geom- 
etry of fn in terms of r„ only (cf. Kolmogorov's Theorem, [H] Chapter 3.3). 
This important fact also shows that we would get the same random field in 
^ had we chosen a different orthonormal basis of £n in the Gaussian linear 
combination. 

1.6. Background and results in related models. The question of dis- 
tribution of various local quantities such as the nodal length, or the total 
curvature of nodal lines in different settings, has been extensively studied. 
It is widely believed [2] that for generic chaotic billiards, one can model the 
nodal lines for eigenfunctions of eigenvalue of order w E with nodal lines 
of planar monochromatic random waves of wavenumber \/^ (this is called 
Berry's Random Wave Model or RWM, see ()12p for the definition). Hence 
the importance of the (random) nodal length and other properties of the 
RWM. Berry [3] found that the expected nodal length (per unit area) for 
the RWM is of size approximately ^/E, and argued that the variance should 
be of order log E. 

The 2-dimensional unit sphere 5^ is another manifold with degenerate 
Laplace spectrum. Here the Laplace eigenvalues are all the numbers E = 
m{m + 1) with m > an integer, and the corresponding eigenspace is the 
space of degree m spherical harmonics; its dimension is 2m -|- 1. One may 
define the random field of degree m spherical harmonics similarly to the torus 
with the plane waves (exponentials) replaced by any L^-orthonormal 
basis {rjm-i, ■ ■ ■ '?m;2m+i} of real valued spherical harmonics of degree m: 



with Ofc i.i.d standard Gaussian. Setting C yf^ j to be the nodal length of 

, Berard [T] computed the expected nodal length 



Wigman [30] found that the nodal length variance is asymptotic to 



which is consistent with Berry's prediction for the RWM. 

1.6.1. Comparing the random wave model to the torus and the sphere. The 
logarithmic variance is much smaller than one would expect: taking into 
account that the wavelength for either the sphere or the RWM scales as , 
one may rescale them to unit wavelength to argue that the nodal length 
variance should be proportional to ^/E. However, a computation reveals that 
the coefficient in front of the expected leading term y/E surprisingly vanishes 
due to "Berry's Cancellation Phenomenon" — the leading term for nodal 
length variance is in fact logarithmic. A similar cancellation phenomenon is 
responsible for the variance ^ in our situation being of order of magnitude 
rather than of order jf^ as was originally conjectured in |24j . 




2m+l 
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As already remarked, in general, defining a centered (or mean zero) Gauss- 
ian random field / on an arbitrary domain T is equivalent to specifying its 
covariance function rf{x, y) := E[/(x)/(y)] on TxT. For the planar random 
waves (RWM) the covariance function is 

(12) rYmM{x,y) = Joiy/E\\x -y\\) 

with Jo the standard Bessel function, and 

rs^{x,y) = Pm{cosd{x,y)) 

for the deg are the usual Legendre 

polynomials, and d is the (spherical) distance. The latter scales as 

Pm{cos{ip / m)) ^ Jo{d), 

uniformly for -0 6 [O, m • ^] . As the corresponding eigenvalues are m(m-|-l), 
this is consistent to the RWM scaling. The covariance function r„(rc) for 
our ensemble /„ of random toral eigenfunctions given by (jH) is of arithmetic 
flavour: it is given by the summation (jlip over lattice points A„ lying on a 
circle. 

The equidistribution of A„ along generic sequences of energy levels on the 
torus mentioned earlier implies that for any fixed y € one may approxi- 
mate 

rniy/(2ir^)) ^ j cos{{y,z))dz = jQ{\\y\\) 

for a generic sequence {nj} C S. Although it is the same scaling limit as 
before, the latter holds for y of fixed size only, and by no means uniformly 
for y G [0, n]^. In particular, as opposed to the other cases, no "intermediate 
range asymptotic" for r„(x) is known, i.e. for x ■ ^Jn — )• oo. It is remarkable 
that even in this case, in spite of the fact that the covariance function for 
jn has the same scaling limit as the RWM and random spherical harmonics 
random fields, the nodal length variance ([9]) of random arithmetic waves is 
of different order of magnitude compared to the other cases. 

1.6.2. Berry's cancellation phenomenon and the 2-point correlation func- 
tion. In order to evaluate moments of the nodal length of a random field / 
we exploit a suitable Kac-Rice type formula (see Section [27T]) . For the vari- 
ance it means that we need to understand the fluctuations of the so called 
2-point correlation function (defined on the domain of /) around its scaled 
asymptotic value at infinity. For both M? (RWM) and 5^ the main con- 
tribution for the variance comes from the "intermediate range" (i.e. a few 
wavelengths away from the origin), where the asymptotic behaviour of the 
covariance function and its first two derivatives translates to asymptotics 
for the 2-point correlation function. No analogous asymptotics is known 
for the torus. The cancellation phenomenon amounts to the fact that the 
leading term in the intermediate range asymptotics for the 2-point corre- 
lation function is purely oscilliatory and its contribution to the integral is 
negligible. Then, the main contribution comes from the second term in the 
asymptotics. 

As a substitute for pointwise asymptotics for the 2-point correlation func- 
tion, we use an arithmetic formula (see (j33p ) valid outside a suitably defined 
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"singular set" (arithmetic in nature; its analogue for and sphere is a 
neighbourhood of the origin, with radius of order wavelength). Although the 
arithmetic formula does not give the pointwise behaviour of the 2-point cor- 
relation function, its arithmetic structure is exploited for averaging over the 
torus and is essential for evaluating the variance. The "arithmetic Berry's 
cancellation" amounts to the Fourier expansion of the highest magnitude 
term of the 2-point correlation function vanishing at the origin, an artifact 
of the seemingly unrelated trigonometric identity 

4 cos{e/2f = 1 + 2 cos + cos{ef 

(see Section [4.21 for more details). 

1.7. Some other related results. For a generic compact manifold with 
no spectral degeneracies, one can also consider random Gaussian linear com- 
binations of eigenfunctions with different eigenvalues (sometimes referred to 
as "random wave on A^"). Berard [T| and Zelditch [32] found that, given 
a spectral parameter the expected nodal length for random Gaussian 
superpositions of eigenfunctions with eigenvalues lying either about E or 
belo'VMJ E is of order ^/E, consistent with Berry's RWM. The subtle ques- 
tion of the nodal length variance in this generic setup is to be addressed 
in [23] . 

Some other generic results concerning random waves with spectral pa- 
rameter E: Toth and Wigman [26] found that the expected number of open 
nodal lines, i.e., the connected component of the zero set that intersect 
the boundary, of the random wave with spectral parameter E on & generic 
surface with boundary is again of order ^/E. Moreover, Nicolaescu [20] eval- 
uated the expected number of critical points to be of order E; the latter is 
also an upper bound for the number of nodal domains. 

For other related or relevant result we refer the interested reader to the 
recent survey [31]. Also, for recent very interesting results and conjectures on 
non-local quantities, such as nodal domains (i.e. the connected components 
of the complement M \ /"^(O) of the nodal line) see [1 [2ll [6l [3 [8] . 

1.8. Outline of the paper. The paper is organized as follows. The proof 
of Theorem 11.11 assuming certain preparatory results is given in Section [2} 
this proof relies on an arithmetic formula (|16p in Proposition 12.11 whose 
proof is commenced in Section [H The proof of the formula (jl6|) is based on 
studying the behaviour of the so-called 2-point correlation function intro- 
duced in Section [H] its subtle asymptotic analysis is given in sections [H [S] 
and [51 with the latter containing a certain technical computation essential 
for understanding the asymptotic properties. 

Section [6] is dedicated to the proof of Theorem 12.21 an arithmetic bound, 
due to Bourgain, needed for the admissability of the error term in (jl6p . In 
Section [7] sequences of energy levels with corresponding discrete probability 
measures ([7]) converging to the measures Ua as in ([10]) are constructed to 
prove the attainability of the latter. 



Called the short or long energy window random combinations respectively. 
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2. Proof of Theorem 11.11 

2.1. Kac-Rice formulas. Moments of the nodal length for smooth random 
fields can be computed using the Kac-Rice formulas |14j . To state them, we 
need some notation. For f = fn, we define its first and second correlations 
as follows: 



Ki=K 



l|V/(y)|| /(?/) = 



(13) i^2(x)=E[||V/(y)||-||V/(y + x)|| f{y) = f{y + x) = 

K2{x) = J-K2(x). 

Observe that K\ and K2 are independent of y because /„ is stationary (for 
general smooth Gaussian fields, they become Ki{y) and K2{x, y)). They are 
called the first and second correlations of the nodal set /^^^(O). K2 is just a 
scaled version of the second correlation. As we are dealing with Gaussians, 
it is possible to write analytical expressions for these as Gaussian integrals 
in terms of r„ and its derivatives (see ([2^ . and ([MI))- 
Then, the Kac-Rice formulas say that 

E[Cn] = J Kidy = Ki, E[Cl] = J K2{x)dx. 

T 

The first of these formulas gives E[£„] = • \/En as was quoted earlier. 
Using this and the second gives 



Var(£„) = ^K2{x) - \ 



(14) Var(£„) = ^ J { ^2(x) - - ] dx. 

T 

Full justification of their validity in our context may be found in [23]. For 
the Kac-Rice formulas in general, consult |14j . 

It is instructive to intuitively understand the function K2{x) in the fol- 
lowing way. Let x £ T, and take a small positive number e > 0. We define 
the random variables 

£^'^ = length {f~\0)nB{x,e)), 

where B{x, e) is the disk of radius e centered at x; Cn'' measures the nodal 
length of /„ inside the corresponding disk. Then we have 
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2.2. Computing the variance - the Gaussian integral. To understand 
Var(/: n) we need to understand the integral in ()14p . The function K2 may be 
implicitly expressed in terms of the covariance function r„ of fn as a Gauss- 
ian expectation of a 4-variate centered Gaussian (V/„(0), V/n(x)) condi- 
tioned on /n(0) = fnix) = 0, with 4x4 covariance matrix Qnix) depending 
on r and its derivatives (see ()29p . (|25p and (|24p ). Li our case the covariance 
function is the arithmetic function ()lip . 

In order to study the asymptotic behaviour of the integral above, we use 
some ideas from [22l |23] and divide the torus into a singular set B and the 
nonsingular complement T\B; only the latter is convenient to work with, so 
it is essential make the former as small as possible. We improve the analysis 
of the earlier paper [25j on both B and T\B. 

A better upper bound for the measure of B is proved using the 6th mo- 
ment of r rather than the 4th one. As an artifact of the definition of B, 
one has a lower bound for the values \r{x)\ on B; using a Chebyshev-like 
inequality on the 6-th moment of r{x), we will bound the measure of so 
that its contribution to the variance is negligible. 

On T\B, where the main contribution comes from, we establish a precise 
asymptotic expression for the 2-point correlation function compared to a 
partial upper bound as in [24j (see Proposition 14. 5p . Here the (scaled) 
covariance matrix 0„(x), defined by ([2^ . is a perturbation of the identity 
matrix. In order to understand its contribution to the integral, we expand 
K2{x) as a function of r2„ into a 4-variate Taylor polynomial around = -^4; 
the identity matrix. In principle, this could be performed using brute force; 
we choose to work with Berry's elegant method [3]. 

The computation above culminates in the "arithmetic formula" given in 
the following proposition. It is of arithmetic essence and at the heart of the 
variance being non-universal, but the derivation itself involves no arithmetic. 
Before stating the proposition, we define 

(15) 7^fc(n) := J |r„(x)|^^(ix. 

T 

Proposition 2.1. The nodal length variance is given by the asymptotic 
formula 

(16) Var(£„) = c„•^ + 0(E„•7^5(n)), 

where we used the notation ([5]) for Cn- 

The reason we refer to p6p as "arithmetic" is that both the main term 
and the error term in l\l<j\] are of arithmetic nature: Cn is related to the 
distribution of lattice points A„ on the circle (see Section [5|), and TZ^ is 
controlled in terms of arithmetics of spectrum correlation. 

The proof of Proposition 12.11 will commence in Section HI It is lengthy 
and quite technical, so it may be omitted on a first reading of the paper. 

In case of random spherical harmonics or the random wave model, one 
arrives at analogous propositions for the variance. The proof essentially ends 
there as the error term may be checked to be smaller that the main term. 
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2.3. Computing the variance - the arithmetic part. In our setting 
however, the main obstacle is in proving the admissability of the error term. 
We will control various error terms in terms of the moments TZk{n) := 
J^\rn{x)\''dx (cf. (USD.) The even moments are naturally related to the 
spectral correlations; for example, it is straightforward to check that 

(17) 7^6(n) = -^|56(n)|, 
where Sq is the 6-correlation set of frequencies 

(18) Sein) 



|(Ai,...A6) G A„ : J^Ai =o| . 



Since for any choice of Ai, . . . , A4 G A„ there are at most 4 possible choices 
for A5, Ae G A„, it follows that |S'6(n)| = O (A/"^) , or, equivalently 

(19) %(„)=0(^). 

The latter bound is not quite sufficient for our purposes, but the following 
result, due to J. Bourgain is sufficiently strong for our purposes. 

Theorem 2.2. As Nn 00, we have the following estimate: 

(20) \Se{n)\=o{M^). 

Consequently, TZQ{n) = 

Theorem 12.21 will be proven in Section [6l 

We give a brief preview of the proof of Theorem 12.21 To refute the possi- 
bility that \S6{n)\ > 7V^, we invoke some techniques from additive combi- 
natorics (see Section [6]). Utilizing a notion of "additive energy" defined in 
Section [6l a certain set A related to the sum set of A„ is shown to contain 
a large subset Ai with "bounded doubling". Using a suitable version of 
Freiman's Theorem, this implies that Ai is essentially a generalized arith- 
metic progression (GAP - see Theorem (63]), i.e., is contained inside a slightly 
larger GAP. This then leads to a contradiction via an application of Chang's 
result |11] on the number of representations of a complex number as a prod- 
uct of elements inside a GAP. 

We note that the bound |5'6(n)| = o{M^) is quite far from the truth; 
Bombieri and Bourgain have recently obtained an exponent savings. We 
further remark that a trivial lower bound is that |5'6(n)| A/"^; taking 
any Ai,A2,A3 G A„ and letting A4 = — Ai,A5 = — A2, and Ag = —A3 yields 
|An|^ = solutions of "diagonal type". We believe that essentially all 
solutions arise like this, and conjecture that for every e > 0, 

|56(n)| = 0,(AA3+^). 
(Possibly the even stronger bound |5'6(n)| = O (A/"^) holds.) 



NODAL LENGTH FLUCTUATIONS 



11 



2.4. Proof of Theorem 11.11 assuming the preparatory results. Given 
Proposition [L2l Proposition 12. 11 and Theorem l2.21 it is now straightforward 
to deduce Theorem 11.11 our main result. Recah that TZk are the moments 
([15]) of r„. Using the Cauchy-Schwarz inequahty on |r-(x)|^ = r^(x) • |7'(a;)p 
together with the bound 7^4 = O {j^) (which follows from the same argu- 
ment that yielded p^ ). and the bound 7^6 (^) = o (^j^^ from Theorem [ 
we obtain 

(21) 7^5(n) = 0(^-^2 

Now, using (fT6]) together with ([2T]) we obtain ([9|). 

Finally, using the second part of Proposition 11.21 and the definition of Cji 
(see ([7|) we find that any c € [1/512, 1/256] is attainable as a limit. 

3. The 2-point correlation function of /„ 

In this section we use the Kac-Rice formula ()14|) that expresses Var(£„) 
as an integral of the (scaled) 2-point correlation K2 defined in ()13p . For 
this we will need to study some aspects of the random field fn first. From 
this point on we fix n and will usually suppress the n-dependency with no 
further note. 

3.1. Joint distribution of values and gradients. In order to study the 
variance, we shall need to study the random vector 

W = Wn;. = {uuU2,Vi,V2) = {fn{0)Jn{x),Vfn{0),yfn{x))eR^ 

Since is a linear transformation of the standard Gaussian a = (ax) G 
M/^", its distribution is also centered (or mean zero) Gaussian, and by the 
stationarity, and x may be replaced by any y and y + x. 
Let 

(22) D = Z?ix2(x) = Vr(x) = '^^e ((A, x)) ■ A 

^^"^ ASA 

(cf. (jlip ). The vector W is centered Gaussian with covariance matrix 
(cf [21], Section 5.1) 

'A 

B* C, 



where 

A{x) 
B{x) = 

and 

C{x) = 



1 r{x) 
r{x) 1 

D{x) 
-D{x) 



f/2 -H{x) 

E 
2 



-H{x) f/2 



where H2x2{x) is the Hessian 

\ A„2 
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by (note that A is a row vector so that A* A is a 2 x 2 matrix). 

The covariance matrix of (V/(0), V/(x)), conditioned on /(O) = f{x) = 

is 

where we write r = r{x) for brevity. Thus, K2{x) = E[||Vi || • ||V2||] where Vi 
are 2-dimensional random vectors with (Vi, V2) having Gaussian distribution 
with zero mean and covariance matrix 0,{x). 

3.2. The scaled 2-point correlation function. It is more convenient to 
work with the scaled covariance matrix 

(25) n{x) = rinix) = ^^n{x). 

Then, the scaled 2-point correlation function defined in (|13p may be written 
as 

where Vi, V2 are centered Gaussians with covariance matrix Q{x). 

At the origin x = the matrix i}{x) is singular and hence corresponds 
to a covariance matrix of a degenerate Gaussian. However for almost all 
X G T, i}{x) is nonsingular, see [24^ Proposition A.l]. We claim that 0,{x) is 
a small perturbation of the 4x4 identity matrix I4, at least, for "generic" 
X. To quantify the latter statement, write 



(26) n{x) = I + 

where 



X Y 
Y X 



(27) X = Y = -—(H + —-jD^D 



2 

and both X = Xn{x) and Y = Yn{x) are small for "typical" x. 

With these computations, we may rewrite the Kac-Rice formula (jl4|) as 
follows. 

Proposition 3.1 (Cf. [21], Proposition 5.2). The nodal length variance is 
given by 

En n , . 1 



(28) Var(£0 = ^ j [K^ix) - -j dx, 

T 

where K2 is the scaled 2-point correlation function given by 

(29) K2{x) = K2;n{x) = —^=L===n\Vi\\ ■ \\V2\\\, 

here Vi , V2 G are centered Gaussians with covariance matrix given by 
(1261) , with X and Y as in ^1 



We shall need the following lemma later. 
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Lemma 3.2. The matrices Xn and Yn are uniformly bounded (entry-wise), 
i.e. 

(30) Xn{x),Yn{x) = 0{1), 

where the constant involved in the 'O' -notation is universal. In particular, 

(31) K2;nix) <. ^ 



V^l - r„(x)2 

Proof. To prove that (j3U|) holds it is sufficient to show that the diagonal 
entries of X are uniformly bounded, by (|26|) (the non-diagonal entries of 
a covariance matrix are dominated by the diagonal ones, by the Cauchy- 
Schwarz inequality). For the latter, it is sufficient to notice that the diagonal 
entries of 17 are positive, and the diagonal entries of X are < (recall (j27|) ). 

To prove that the bound in (j3ip holds, we use (|29p . the Cauchy-Schwarz 
inequality and ([30]). ■ 



4. Proof of Proposition 12.11 



To find the asymptotics of the integral (j28]l . we will study the pointwise 
asymptotic behaviour of K2. Even though we will only be able to determine 
the precise asymptotics outside the so-called singular set, already used in [22] 
and [23], we will prove that the exceptional singular set is small, so that its 
contribution is negligible (see Lemma l4.4p . To quantify the last statement, 
we will control the contribution using a Chebyshev-like inequality, so that 
the corresponding error term will naturally involve the moments (jlSp of the 
covariance function (jlip . We improve upon the analysis of [22] by working 
with the 6th moment TZQ{n) rather than TZ^^n). 

4.1. The singular set. For r(x) bounded away from 1 we may expand the 
^ factor in (|29p and related expressions into the Taylor series around r = 

0. Since the moments of r are "small" (by Theorem I2.2| say), a Chebyshev- 
like inequality, implies that r(x) is small outside a small set. This is the 
main idea behind the notion of the singular set to follow. We use a slightly 
stronger definition in order to endow the exceptional set with a structure as 
a union of squares, necessary in order to find its contribution to the integral 
()28p . The following definitions are borrowed directly from [22], Section 6.1. 



Definition 4.1. A point x S T is a positive singular point if there is a set 
of frequencies A^, C A with density 

lAJ 7 
> - 



|A| 

for which cos(27r(A, x)) > 3/4 for all A G A^.. Similarly we define a negative 
singular point to be a point x where there is a set A^; C A of density > | 
for which cos(27r(A, x)) < —3/4 for all A G A^;. 

Let M « y/En be a large integer. We decompose the torus T as a union 
of M'^ closed squares of side length 1/M centered at k/M, k G Z^. The 
squares have disjoint interiors. 

Definition 4.2. A square is a positive (resp. negative) singular square 
if it contains a positive (resp. negative) singular point. 
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Definition 4.3. The singular set B = Bn is the union of all singular squares. 

Note that, by the definition, each singular square contains a singular 
point; however, points in B are not necessarily all singular. Let y £ B he 
a point lying in a positive singular cube, x be the corresponding positive 
singular point lying in the same singular cube and A^,. C A the frequency 
set prescribed by the definition of a positive singular point. It is easy to see 
that 

|cos(27r(A,y)) - cos(27r(A, x))| < 

where the implied constant is absolute, so that one may choose M ~ V^^n 
for which the latter expression is < |; it will then imply 

cos(27r(A,y)) > ^ 

for every A G A^;. We then conclude that 

r{y) = AtY1 cos(27r(A,2/)) + J] cos(27r(A, y)) 



. ^ ^ . 7 15 



1-^1 1 

|A| ^ 2 ~ TaI ^ ^ " 16 8 16 

' ' AeA^ ' ' AeA\A^ 



and, similarly, if y is lying in a negative square then r(y) < — Hence we 
have \r{y)\ > ^ on all of B. We then write 



T^&in) > meas(i?) 



16 



to obtain the Chebyshev-type inequality 

(32) meas(B) < 7^6(n). 

It was proven ([22], Section 6.5) that if S is any singular square then its 



contribution to the integral ()28p is 

« I \K,ix)\dx « 
s 

Since the number of the singular cubes is 

< meas(S), 
the total contribution of B to (j28p is bounded by 

/ \K2{x)\dx < meas(S) 3_ = meas{B)^£= < TZe{n) 

J My En V En 

B 

by (|32p and Ad ~ \fEn- The latter is summarized in the following lemma: 

Lemma 4.4 (Cf. [22], Section 6.3). The contribution of the singular set to 
()28p is bounded by 

' \K2ix)\dx = O {TZein)) . 
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Lemma 14.41 bounds the contribution of the singular set to the integral 
in (j28p . The main contribution comes from the nonsingular set; in order to 
evaluate it we will need a precise point-wise estimate for K2{x) in this range; 
this is given by the following proposition, up to admissible error terms. (To 
verify the admissibility, see Lemmas 14.61 and 15. 4p . 

Proposition 4.5 ("Intermediate range" asymptotics for i^2)- For x € T\B 

we have 



(33) 



K2{x) = j+L2{x) + e{x) 



where the main term L2{x) is given by 
(34) 



, , 1 /r2 trX tr(y2 3 

L2(x) = \ \ + -r 

2\ J 4V2 2 8 



4 tr(Xy2) tr(X^ 



16 



+ 



tr(y4) tr(y^)^ trXtr(y^) 1 2 



256 



+ 



32 
1 



512 



32 4 16 ^ ^ 



with X = Xn{x), Y = Yn{x) and r = r„(x), and the error term e{x) is 
bounded by 



(35) 



|e(x)| = O {r{xf + tv{X^) + tr{Y^)) 



Proposition 14.51 will be proved in Section [SJ Assuming it, we arrive at the 
proof of Proposition 12.11 

Proof of Proposition \2.1\ assuming Proposition \4-5\ We invoke Proposition 
l3.1l to express the nodal length variance. Since the contribution of K2{x) to 
the integral (j28|) on B is 

0{En-TZ(,{n)) 

by Lemma 14.41 we have 
(36) 



Var(£„) = |i 



En 

2 



j {K2{x)--^dx 

T\B 

'f ' ' 

/ L2{x)dx + 



T\B 



En • 



\e{x)\dx 



+ 0(En•7^6(n)), 



by Proposition 14.51 Note that 

\e{x)\dx< / \e{x)\dx = O {En ■ TZein)) 
Jt 

by ^^ij and Lemma 14.61 to follow (see parts fTUlfTT]) , so that 
(37) Var(£„) = ^ J L2{x)dx + O {En ■ Uein)) . 

T\B 



IS 
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We may further note that, smce L2{x) is uniformly bounded thanks to 
Lemma 13.21 

^ j L2{x)dx = 0{En- meas(S)) = O (E„ • 7^6(?^)) , 

B 

SO that we may rewrite (|37p as 

(38) Var(£„) = ^ j L2{x)dx + O {E^ ■ n^in)) , 

T 

the upshot being that we are now able to use the definition ()34p of L2 and 
integrate the RHS of ()34p term by term, as in Lemma [4.6l (where the domain 
of integration is the whole of torus T rather than T\B). We then perform 
the term-wise integration of (|34p to obtain (with Lemma l4.6p 

4 • / L2(x)dx = -V(----2 + -- 4 

T 

+ ^-4(7 + /2„(4)2) + 1.8-^-2 + ^.8) + O {n^in}) 

1 l + /in(4)^ , 

= ^-^^ + 0(^5(n)). 

Collecting all the constants encountered and bearing in mind (|38p yields 
(jl6p . which is the statement of the present proposition. ■ 

4.2. Some remarks on arithmetic Berry cancellation. While the con- 
stant term | cancels out with the expectation squared, the leading noncon- 
stant term of the scaled 2-point correlation function (i.e. the leading term 
o{K2{x)-l) is 

1 / 9 tr(y2)\ 1 / „ 2 ,1 

-(r^ + trX + ^^l ^ - ir^ - —DD' + ^trW 



n 



where we neglected some lower-order terms. Denote the expression in paren- 
thesis 

(39) vix):=r^-^DD^ + ^trH^. 

We may substitute pT]) . (|22]) and (|23l) into (|39]) to rewrite v{x) as 
^^^)=]^ Yl e((Ai + A2,.x)) + -^ Yl £;^/4^2 ^((^i + ^2,x)) 



AA2 



Ai,A2eA„ 

on recalling dSj). 



E fl + 2^ + ^^)e((Ai + A2,x)), 
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Note that 

— = cos0(Ai, A2), 
n 

where 6{-,-) is the angle between two vectors in M?. Thus we may write, up 
to lower order terms, 

^(^) = ;^ Yl (1 + 2cos0(Ai,A2) + cos(0(Ai,A2))^) e((Ai + A2,a;)) 

Ai,A2eA„ 

Ai,A2GA„ ^ ^ 

by the usual trigonometric identities. Upon integrating (|28p . all the sum- 
mands vanish except for Ai + A2 = 0; the corresponding angle 9 is given 

by 

= 0(Ai,A2) = 7r, 

so that cos(0/2) = 0. Thus the arithmetic cancellation phenomenon in the 
length variance amounts to cos(0/2)^ vanishing at = vr. 

4.3. Integrating matrix elements. We may obtain an asymptotic ex- 
pression for the nodal length variance upon using ()28p with Proposition 14. 51 
provided that we are able to integrate the expressions on the RHS of (j33p . 
term-wise. This is done in Lemma 14.61 to follow immediately. We choose 
to control the various error terms encountered in terms of the moments of 
r, TZk (recall the notation (jl5p ). It will turn out that we will be able to 
control the error terms in terms of TZ^ (and TZq < TZ^), admissible thanks 
to Theorem 12.21 via a simple Cauchy-Schwarz argument (see the proof of 
Theorem II. II in Section [2.4p . The proof of Lemma 14.61 is left to Section [5.11 

Lemma 4.6. As Nn 00 we have the following estimates. 

1. JtTX{x)dx = -^-j^ + 0{ne{n)). 

T 

2. ftr{Y{xr)dx = j^-^ + 0{n,{n)). 

T 

3. /tr(X(x)y(x)2)dx = -^ + 0(7^5(n)) 

T 

4. ftT{X{xf)dx = j^ + 0{nG{n)) 

T 

5. / tv{Y{x)^)dx = -^(11 + /2„(4)2) + O (7^6(n)) . 

T 

6. / tv{Y{x)^)^dx = A{3 + /2„(4)2) + O (7^6(n)) 

T 

7. ftrX{x)tr{Y{x)^)dx = - 4^ + O (Tie (n) ) 

T 

8. fr{x)^tTX{x)dx = -j^ + 0{TZ6{n)) 

T 

9. / r(x)2 tv{Y{x)^)dx = j^ + {TZ^{n)) . 

T 

10. / iT{X{xf)dx = 0{n^{n)). 

T 
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11. / tr{Y{xf)dx = O (7^6(n)) . 

T 



5. ASYMPTOTICS FOR THE 2-POINT CORRELATION FUNCTION 

The ultimate goal of this section is to prove Proposition 14.51 To estabhsh 
the desired asymptotics for ()29p . one needs to understand the behaviour 
of E[||Fl|| • IIV2II] where (Vi, V2) is a centered Gaussian with covariance Qn, 
the latter being a small perturbation of the indentity matrix, given by (j26p . 
where both X and y are small. That is, we expand y) = E[||Vi || • IIV2II] 
into a Taylor polynomial of the entries of X, Y, about X = Y = 0. 

The degree of the required Taylor polynomial in each of the variables is 
determined according to its (average) order of magnitude and the admissible 
error term. In principle, one may compute the polynomial by brute force, 
computing each derivative separately, but this approach results in a long and 
tedious computation. In this manuscript we employ Berry's method [3] in or- 
der to compute the nodal length fluctuations for the random monochromatic 
planar waves. The following lemma provides the Taylor approximation of 
F{X, Y) for perturbed standard Gaussian. 

Lemma 5.1. Let A G M^iR) be a positive definite matrix such that 



/ + 



X Y 
Y X 



where X,Y G M2(M) are symmetric, rank{X) = 1. Define 

F(X,y)=E[||l^i||-||T^2||], 
where {Wi,W2) £M? xE? is centered Gaussian with covariance A. Then 

Fix, Y) = -(l + ^-^ + ' 



2 V 2 8 16 32 256 

tr(y2)2 trXtr(y2)\ ^, ^^^^ 
+ + ^ + • 

Proof of Proposition \4-5\ assuming Lemma \5.1[ Note that since D^D is a rank 
1 matrix, it satisfies 



ii{D*D) = DD\ 

A straightforward application of Lemma 15.11 with X and Y given by ([2 
yields (for a; G T \ r is bounded away from ±1, so we may write 
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l + ir2 + |r4 + 0(r6)) 



+ 



+ 



1 


2ttVI - 


- r 


1 




4Vl - 




tr(y4) 


+ 


256 




y 


tr(y4) 


+ 


256 



• F(x,y) 

^ _^ trX _^ tr(y2) tr(Xy2) tr(X2) 



16 



32 



-2\2 



512 



2 



+ 



32 

3 



^ + O (tr(X3) + tr(y*^)) 



trX tr(y2) 3 4 tT{XY^) tr(X^ 



8 



512 32 
+ 0(r*^ + tr(X3)+tr(y6)) . 



16 32 

. .tX+—i 
4 16 



+ -r^trX ^ r^tr(y 



To present the proof of Lemma l5. II we need some notation. 

Notation 5.2. For a matrix A and a number a we write A = 0(a) if the 
corresponding inequality holds entry-wise. 

Notation 5.3. For t G we denote m{t) := min{t, 1}, and for t, s € R, 

m(t, s) := m{t) ■ m{s). 
Proof of Lemma \5.1\ Following Berry, see [31 Eq. (24)], 

oo 

1 f , _at dt 



we have 

(40) E[\\W,\\ ■ = ^ 11 [f{0,0) - f{t,0) - f{0,s) + f{t,s)]^^, 



a 



where 
(41) 

/^'^(t,s) = /(x,y) :=E 



exp (^-^{\\Wi\\' + \\W2f)^ 



det(/ + M) ' 



with 



/Vt/ ]^(Vii 
I V^i) [ ^ij- 



Now by the well-known formula for the determinant of a block matrix (see 
e.g. [H], p. 210), we have 

det(/+M) = det ((1 + t)I + tX)-det ((1 + s)I + sX - stY{{l + t)I + tX)-^Y) 

so that 
(42) 



det(/ + M)^^/2 ^ ^ ^ ^j^) 



-1/2 



det ((1 + s)/ + sX- stY{{l + t)/ + tXy^Y 



l^.^-V2 
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Now we compute each of the two factors of the RHS of (|42p . up to the 
admissible error terms and , as in the formulation of Lemma I5.lt 



(43) det(/ + = 1 _ 1 tr A + ^ ti{A^) + -(tr Af + 0{A^), 

2 4 8 



so that the first factor in the RHS of (jl2|) is 
(44) 

det ((1 + t)I + txy'/^ = ^ det [l + 

= • ( 1 — rtrX + — ^ — 7TTtr(X2) _^ t — j^^2 ^ Qrj^i 

l + t V 2(1 + t) 4(l + t)2 ^ ^ 8(l + i)2^ V ^ V 

To compute the second factor in the RHS of (jl^j) we write 

and we then have 
(45) 



det / + X - -Y I + X Y 



1 + s y i + s (i + s)(i + t) V i + * 

^ .detf/+-^X-- v>^' + 7; ?5 y^YXY + OiYX^Y) 



l + S V 1 + ^ (l + s)(l + t) (l + s)(l + t)2' 

^ ^--^trx + i- ^tr(y2)_i^ — ^tr(yxy) 



1 + sV 2 1 + s 2(l + s)(l+t) ' ' 2(l + s)(l+t)2 

+ 0(tr(X3) +tr(y6)) 



upon using (j43|) with 



2 



^ = X - 7 -y^ + rr^YXY + o(yx^y). 

l + S (l + s)(l+t) (l + s)(l+t)2 ^ ^ ^ 
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Cross multiplying (|44p and (j45p and substituting into (j42p and finally into 
[1]), we obtain an asymptotic expression for f^'^{t, s) of the form 

(46) 

f^X^t ^) = 1 ^ ( 1 _ tr X + r tr(y2)_ 

^'^ (l + t)(l + s)V 2 1 + s 2{l + s){l + t) ^ ' 

^ tr(yxy) + tr(x2) + y trfy^) 

2(l + s)2(l+t) ^ ^^8(1 + S)2^ ^ ^ 8(l + s)2(l + t)2 ^ > 



1 - - ( + ) trX+ -y^^^y^^tr(y^ 

tr(xy2) 



(l + t)(l + s)V 2VI + S l + tj 2(l + s)(l + t) 

1 / 5*2 sH 



2 V(l + s)(l+t)2 (l + s)2(l + t) 



3 X ts \ 1 s2^2 



+ (^8 (1 + s)2 + 8 (1 + t)2 + 4 (1 + 0(1 + s) J ^ ^ 4 (1 + s)2(l + t)2 ) 

+ 8(l + .)2(l + t)2*'^^ ^ - i 1^(1 + S)2(l + t) + (1 + s)(l + t)2 J ^ 

+ 0(tr(X3) +tr(y6)) 
where we used 

(47) tr(yxy) = tr(Xy2), tr(X2) = (trX)^ 

the latter thanks to rkX = 1. 

It is important to identify ()46p as the Taylor expansion of f"^'^{t,s) for 
fixed t, s, as a function of X, Y around X = Y = (i.e. a Taylor poly- 
nomial in terms of the entries of X and Y). In the next step we perform 
the integration in ()40p by integrating the various terms in (06]). The main 
problem is that the error term does not depend on t,s, so that its integral 
against ^3/2^3/2 is divergent at the origin. We improve the error term in the 
following way: define 

(48) 9^'^(t, s) := /(0, 0) - fit, 0) - /(O, s) + f{t, s), 



so that under the new notation (1401) is 



1 f f XY ( ■ dtds 



(49) E[\\W^\\ ■ \\W2\\] = - 9^''{t,s) 



2vryy ^ ' ' '(is)3/2- 



It is evident that for every X, Y the function g^'^ vanishes if either t = 
or s = 0, so that for t,s > 0, 

g^'^{t,s) = Ox,Yits). 
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We now substitute ([16|) into (P8|) in order to expand g-^'^ {t, s) into a Taylor 
polynomial around X = y = 0; by the latter observation the remainder 
term may be improved from O (tr{X^) + tr(y^)) to 

O {m{t, s){tr{X^) + ti{Y^))) 

(recall Notation I5.3p . To compute the contribution of each of the summands 
in (|46p. we notice that each of summand splits into a product (j){t)il:{s) for 
some (j) and "0 (that are read off directly, for example, for the constant term 
(i-l-tXi+s) ' '^^^•^ ~ TTt ^"^^ ''Pi^) — TTs-'' ®° ^^^^ corresponding term of 
g^^^{t,s) in dM]) is 

c|>{t)^l,{s) - mm - mm + mm = m) - m)im - m)- 

Therefore, the corresponding term in the integral (|49p splits as well. We 
then finally obtain 

(50) 



g^'^ (t,s) = + - 7 TTT + 7 TTT trX 



2(l + t)2(l + s)2 ^ ' 2\{l + ty^{l + s)^ ' {l + t)^l + S 



3 t s"^ 3 t"^ s 1 t 



81 + t(l + s)3 8(l + t)3l + s 4(l + t)2(l + s) 

1 ^2 2^ ^2 ^2 

+ 4(l + t)3 (l + 5)3*^(^') + 8(l + t)3 (1 + 5)3 



4 V(l + t)3(l + 5)' (l+t)2(l + 5)^ 
+ 0(m(t,s)(tr(X3)+tr(y'5))) . 

Note that rather than improving the error term in the last step we may 
incorporate the improvement into more precise versions of (j44p and ()45p 
and then carry the improved error term along; it would result in the same 
formula (j50l) . 

Inserting ([50]) into (09]) yields 

E|||HM|.||H'.||l = ^('l + ^!^+''<^'' "■<-^'' 



2 V 2 8 16 32 



^ tr(y^) ^ tr(y2)2 tr X tr{Y^ 



+0(tr(X3)+tr(y6)) 



256 512 32 

using the elementary integrals 

oo oo oo 

t \ dt f dt ^ /" Vidt TT 



vr; 



l + t J t3/2 ' J (l + i)2^ 2' 7 (l + i) = 

u 

which is the statement of the present lemma. 
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5.1. Proof of Lemma 14.61 To prove Lemma 14. 61 we will need the following 
lemma (which establishes the asymptotics for some expressions involved in 
X and Y - see ([27|) ). whose proof is relegated to Section [8l 

Lemma 5.4. We have the following estimates: 



2. jD{x)D{xYdx = ^^. j[D{x)D{xYYdx = 2.^[l + 0[^^ 

3. Jr[xfD{x)D{xfdx = |% (l + O 



4. JiT{H{xf)dx = ^^. fr{x)Hr{Hixr)dx = 2.^(l + 0(j^ 

5. ftv{H{x)')dx = §,{n+Jlnm + 0[§ 



Jtr{H{xyydx = ^(7 + /2„(4)2) + O (llj . 

6. jD{x)D{x)Hv{H{xf)dx = H (l + O (;^)) . 

7. / r{x)D{x)Hix)DixYdx = -i • || (l + O 

8. / D{x)H{xfD{xYdx = 1 . + O 

9. / {D{x)D{xYfdx = O {ElTZein)) . 
10. / r{x)^D{x)D{xydx = O {EnTZe{n)) . 



11. Jtr{H^)dx = O (£;^7^6(n)) 

T 



Proof of Lemma |^.6] assuming Lemma 5.4 ■ In this proof we will suppress 
the dependence on x (and n), i.e. use the shortcuts r = rn{x), X = X„(x), 
Y = Yn{x), D = Dn{x), H = Hn{x). We have 

JtrXdx= J tr Xdx + 0{meas{B)) 

T T\B 

by the uniform boundedness ()30p of X. Since meas(i?) is small (|32p and on 
T\B we may write 



1 — 
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we have 

tr Xdx 



T 



~ j j DD^dx + J r'^DD^dx] +0(7^6(n)) 

Vt T / 



by parts [TOl [2] and [3] of Lemma 15.41 Arguing in a similar fashion, we obtain 
jtT{Y^)dx j [t^iH^) + 2rDHD'] dx = ^ - ^ + O (7^6(n)) , 

J tr{XY'^)dx ~ - Jg J DH^D'dx = + 0(7^5(n)), 

J tv{Y')dx ~ ^ 1 tr{H^) + O (7^6(n)) , 

T T 

J tr{YYdx = ^ j ti{H^fdx+0{nQ{n)) = ^{7+rin{^f)+0{n^{n)). 

T " T " 

This shows parts [H El El El and El parts miZl [5] and being similar. 

To see part llOl we notice that, as X is uniformly bound (jSOp . and meas(i?) 
is small (|32p . it is sufficient to bound the contribution on T\i? only, so that 
we may assume that r is bounded away from ±1: 

(51) J tr{X^)dx jiDD^dx + nein). 

T T 
Part [TU] of Lemma 14.61 then follows upon applying part El of Lemma 15.41 with 
(|51|) . The proof for part[TTlis very similar, using part[TTlof Lemma [5. 4[ and 
we omit it here. ■ 



6. Proof of Theorem 12.21 

We begin by recalling some needed results from additive combinatorics. 
An additive set is a finite and non-empty subset of an ambient (additive) 
abelian group Z. Given an additive set A, we define E{A,A), the additive 
energy of A by 

E{A,A) := \{{yi,y2,y3,y4) ^ A'^ ■yi + y2 = + 2/4}! • 

We shall use the following "large energy version" of the Balog-Szemeredi- 
Gowers theorem (see |25t Ch. 2.4-5]): 

Theorem 6.1 (BSG). Let A be an additive set, and let K >1. There exists 
an absolute constant C with the following property: if E{A,A) > \A\'^/K, 
then there exists a subset Ai Q A satisfying 

(52) \Ai\>K-^\A\ 
and 

(53) \Ai+Ai\ < K^\A\. 
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Remark 6.2. Theorem [6TT] can easily be deduced from Proposition 2.26 and 
Theorem 2.31 of [25] as fohows: by Theorem 2.31, E{A,A) > \Af/K im- 
phes that there exists subsets Ai (1 A, A2 Q A with \ Ai\ > K~'^'\A\, \ A2\ > 
K~'^'\A\ satisfying ^(^1,^2) < C'logi^ (where A2) denotes the Rusza 
distance between Ai,A2, and C is an absolute constant.) By Proposi- 
tion 2.26, d{Ai,A2) < C'logK implies that \Ai + ^i| < K^"\Ai\ for some 
C" only depending on C' . Taking C = max(C", C") the result follows. 

If G is a (torsion free) abelian group, a Generalized Arithmetic Progres- 
sion (GAP) of dimension d, is a subset P C G of the form 



(54) 



-P = + Yl^^^'' : < ifc < Jfc for A; = 1, . . . , d| 



with £,0, ■ ■ ■ ^ G. A GAP P is called proper, if |P| = 0^=1 -^k (^•^■' 
elements in the sum ^0 + Ylk=ijkCk are distinct). It is easy to see that a 
GAP has "bounded doubling", i.e., that l^d-f-AI/l A| is "small". A surprising 
converse is Freiman's celebrated structure theorem — an additive set with 
small doubling is essentially a proper GAP: 

Theorem 6.3 (|25j. Theorem 5.33). Let A be an additive set in a torsion 
free group G such that 1^4+^1 < -f^|A|. Then there exists a proper generalized 
arithmetic progression P, of rank at most K — 1, which contains A such that 

\P\ <exp (O fi^^W)) 1^1 . 



If A C C and 2; G C, let r2{z,A) denote the number of representations 
of 2; as a product of two elements from A. The following result by Chang 
shows that r2{z,A) is quite small when ^ is a GAP. 

Proposition 6.4 ([11], Proposition 3). Let P <^ C be a GAP of the form 
(fM|) where • • • 1 '^d G C. Then, for all z G C, 



(55) r2(z,P) <exp{C,- ^"^-^ 



log log J ^ 

where J = maxi<fc<;rf J^, and the constant only depends on the dimension 
dofP. 

Proof of Theorem \2.2l Assume that \Sn\ = o{J\f^) does not hold, i.e., that 
there exists some 5 > Q such that 

(56) |56(n)| > 5M^ 

for Nn arbitrarily large. Using sum-product type estimates, we will show 
that this leads to a contradiction. 

To simplify the notation, let S = S^{n), and = Mn- From this point 
on in this proof we assume that 5 is fixed; we will write F < G for some 
expressions G (resp. F > G), if there exists a constant G (which may 
depend on 5 only), such that F < G ■ G (resp. F > G ■ G). 

Define 

(57) A = ^„ := (A„ + A„)\{0}; 
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note that A then consists of elements that have two (or exactly one for 
elements of the form 2A, A S A„) representations as sums of elements of An, 
and we also note that A is symmetric around the origin. Thus 

(58) \A\ = N'^/2 + 0{N) 
and ([56]) implies 

(59) \{{yuy2)&AxA:yi + y2£A}\>N\ 

(Note that the number of solutions to Aj = with the additional 

constraint that one of Ai + A2, A3 + A4, A5 + Xq equals zero is O(iV^); this 
follows immediately on noting that Aj + Xj = z has at most four solutions if 
z/0.) 

Letting 1^ denote the characteristic function of the set ^ C Z^, we have 

(60) \{{yi,y2) e A X A : yi + y2 e A}\ = {1a*1a, ^a), 

where we understand both the inner product and the convolution as de- 
fined on L^(Z^). Together with ()59p and the Cauchy-Schwarz inequality the 
observation (|6Up yields 

iV^ < (1a * 1a, 1a) < Ua * lAh ■ 1^1'/' ~ ^||1a * lAh ■ N. 

We may hence estimate the additive energy of A as 

E{A,A) = |{(2/i,y2,2/3,2/4) e : yi +^2 = ys + yi}] 

= \\lA*tA\\l>N'>\A\\ 

We now apply Theorem 16.11 on A with K = K{S) constant, to construct 
a large subset Ai <^ A having the "bounded doubling" property ([53]) . and, 
in addition (j52p : together with (j58p the latter implies 

(61) l^i|>A^'. 

Hence, by applying Theorem 16.31 with G = 7? and Ai C G, there exists a 
proper GAP 

(62) P = jeo + ■0<jk< Jk for /c = 1, . . . 

as in (j54p . of bounded dimension (depending on 6 only), 

(63) d{6) = d{K{6)), 
so that A Q P and 

|P| <exp (O l^il < 

We then have 

l^il = |Pn^i| < \PnA\ < J2 l(^-a;)nA„|, 

x-eA„ 

where for the latter inequality we used the definition ()57p of A. Hence, by 
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and therefore (the length of summation being A^), 

|(P-x)nA„| >iv 

for some x G A^. Replacing P by P — x if necessary, we may assume that 

(64) \Pr\K\>N. 

Using Chang's Proposition 16.41 the latter leads to a contradiction as fol- 
lows. If P = {eo + Et=iMk : < jfe < Jk for A: = 1, . . . , d}, then PU P is 
contained in a GAP, of dimension 2d+ 1, of the form 

{d 2d ^ 

k=l k=d+l } 

where < jo < 2, < jk < Jk for A; = 1, . . . , d, and < j'^ < Jk-d for 
k = d+l,...,2d. 

Considering P' as a subset of Z + it is clear (since for every z £ An, 
z -z = n) that 

(65) r2{n,P')>\P'nAn\>N 

by (|64p . On the other hand, Proposition 16.41 applied on P' implies that 

log J 



r2(n,P') < exp (^C2d+i 



log log J 



where 



J = max Jk < \P\ < iV^ 



l<k<d 

with Jk as in and thus 



r2{n,P') < exp ^C2d+i: 



logiV 



log log N 

Combined with (|65p the latter estimate implies 

log N 



iV<exp C2d+i , 

\ log log A* 

or, taking logarithm of both sides, 

logiV 



log N <C- 



log log A'" 

for some C = C{6) that may depend on 5 only (by ([63]) ). This is clearly 
impossible for N arbitrarily large, and the desired contradiction concludes 
the proof. ■ 

7. Probability measures on arising from A„ 

Recalling that 5 = {n G Z : n = + 6^, a, 6 G Z}, define 

S{x) := {n £ S : n < x}, 

and for a subset S" C S* similarly define S'{x) := {n £ S' : n < x}. We 
say that a set 5' C 5 has asymptotic density s £ [0, 1] if lim = s. 

Further, we say that a subsequence (nj)j>i of elements in S is thin if the 
subset {nj}j>i C S has asymptotic density zero. 
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It is known [19], that as x — )• oo, S(x) ~ where c > is known as 

' ' ' ' \ / V log X 

the Landau-Ramanujan constant; in particular Mn grows as ~ c-y/log x on 
average for n < x. Moreover, a straightforward modification of an Erdos- 
Kac type argument to the set S shows that 

\{n G S{x) : logAA„ > loglogn}] = \S{x)\ ■ (1 + o(l)) 

as X — )• oo, and consequently there exists a density one subset S' C S such 
that Mn — )■ oo if n € 5" and n — )■ oo. 

Further, the lattice points A„ are equidistributed on along generic 
subsequences of energy levels (see e.g. \16\ Proposition 6] in the following 
sensci: there exists a density 1 subsequence S" Q S so that /i„ i/, 

n£S" 

where u is the uniform probability measure 

(As usual, the notation Vi ^ v stands for weak convergence of probability 
measures on S^, i.e., that J fdvi — )• J fdv for every continuous bounded 
test function /.) In particular, for a generic sequence of elements n £ S, 
Mn — >■ 00 and the points in are equidistributed in S^. 

In the other direction, Cilleruelo [13] has shown that there are thin se- 
quences {Eni)i>i ■^ith Mm — ^ 00, such that converges to the atomic 
probability measure supported at the 4 symmetric points ±1, iti: 

1 ^ 

(66) ^ j/Q := -^^.fe. 

fc=0 

7.1. Some number theoretic prerequisites on Gaussian integers. 

Before proceeding with the proof of Proposition 11.21 we begin with some 
number theoretic preliminaries on jjLn (see e.g. jl3]). 

To describe /U„ we recall some basic facts about Gaussian integers. Given 
a prime p = 1 mod 4, the equation + = p has exactly eight solutions 
in integers x,y, and there is a unique solution satisfying < Up < Xp. We 
can hence attach an angle 6p G [0, 7r/4] to each such p by writing Xp + ii/p = 
y/pe^^p. On the other hand, given a prime q = 3 mod 4, the equation 
x^ + = q has no solutions; whereas x^ + = 2 has exactly four solutions. 
Moreover, the following holds for the ring of Gaussian integers: the units are 
given by i'' for k € {0, 1,2,3}, the set of Gaussian primes are, up to units, 
given by 1 + i, primes q G Z"*" with q = 3 mod 4, and to each prime p = 1 
mod 4 there corresponds two Gaussian primes, namely Xp + iyp = ^Jpe^^p 
and Xp — iyp = -y^e~*^p. 

The elements of A„ can then be parametrized as follows: let 

where pi and qi are all the primes satisfying pi = 1 mod 4 and q = 3 mod 4. 
Each pair (x, y) arises as follows: with 2: = x + iy, we have 

z = x + iy = i''.{l + if^ . Yl {^"^e'^^^p-^^P^^p) ■ W q^ 



"^Proposition 6 in [TB] implies that all the exponential sums are o(l) for a density one 
sequence of energy levels. The equidistribution follows from the Weyl's criterion. 
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where k G {0, 1, 2, 3}, and < < ep for each p\n. 

We can now describe Hn as convolutions over prime powers: define 

1 ^ 

k=0 

(/ii = i/Q as in dee])), /i2<=2 := '5((i+j)/y2)«2 > and 

1 

+ ^ Zp=0 

(the "desymmetrized" version of Then 

where the convolution of two measures /i, /i' on is given by 
{fiir fi'){z) = / fi{w)id'{z/w)dw. 

7.2. Proof of Proposition 11.21 

Proof. That z^o and arise as weak limits of (/U„.)i>i was already noted 
in the introduction of Section [71 

To show that the same is true for Ua for a G (0, 7r/4) we argue as follows: 
An easy consequence of Gaussian primes being equidistributed in sectors 
(cf. [T71 [12] ) is that there exists an infinite sequence of primes pi < p2 < ■ ■ ■ 
such that 9p. — )• 0, where each pj = 1 mod 4 (also see [13].) To proceed we 
construct a sequence of integers rij such that 

(67) jink i>a; 

this will immediately imply that — t- I'a since /i^n. and Va are the sym- 
metrized versions of and Da respectively. 

Thus, let Cfc = [a/dpk]: where [a/Op^,] denotes the integer part of a/6p^, 
and define 

rik =Pk ■ 

Then, for / any continuous function on S^, 

(68) / /(0)ci/2„,(0) = ^f;/(e^(^^-2')^-O- 
The latter is the 20^^ -spaced Riemann sum for the integral 

with 

"fc = ^Pk ■ [«/^pJ = a - ^Pfe • {a/Op J, 
where {•} is the fractional part of a real number. 

Note that since Opj^ — >■ (so that the Riemann sum spacing vanishes) , 

\Opk-WOpk}\<Opk^o^ 
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and so — )■ a. Therefore, as fc — )• oo, we have 

for any continuous test function /. Thus all z/q are attainable as limiting 
measures, and the proof of the first statement is concluded. 

To see that the Fourier coefficent Va{^), for a G [0, -j], attains all values 
in [0, 1] it is sufficient to notice that 

i?o(4)=/iT(4) = i, z;;;74(4) = o, 

and clearly the function a 1— )■ z?a(4) is continuous. Therefore, by the inter- 
mediate value theorem, given any value b G [0,1], there exists a number 
a = a{h) so that z/a(4) = b. Since Va is attainable for all a, the second 
statement of Proposition 11.21 follows. ■ 

8. Proof of Lemma [531 
For a probability measure on 5^ we define 

Bi{ii):= j {zi,Z2)'^dfj.{zi)dn{z2), 

the 4th moment of cosine of the angle between two random points on 5^ 
drawn independently according to fi. For instance, if /i = /i„ are the atomic 
measures in ([7]), 

(69) BM = j^ J2 (^i'^2)' = -^ cos(e(Ai,A2))^ 

is the 4th moment of cosine of the angle ^(Ai,A2) between two random 
uniformly and independently drawn A„-points Ai,A2. While the expression 
B^^n) comes up naturally from some of the expressions evaluated in Lemma 
15.41 it is simply related to /in (4) as in the following lemma. 

Lemma 8.1. For any probability measure fi on , invariant w.r.t. x ^ ix 
and X I— 7- x, we have 

Proof. Use for zi, Z2 G 

Z1Z2 + Z1Z2 

{Z1,Z2} = 

together with the binomial formula for (^1,22)^ or, alternatively, the stan- 
dard identity 



cos(^)^ = 1 + 1 cos{4e) + I cos(20) 
00 2 



to rewrite B/i{^) as 

^4(m) = 1 + 



j (zfza^) + hi [zW)^ diJi{zi)diJi{z2) 
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The statement of the present lemma follows upon noting that j z^dfi(z) = 

S 

f z'^dfi(z) = /I(4) G R and J z'^dfi{z) = f z'^dfi{z) vanish by the symmetry 
S S S 

assumptions. ■ 



Proof of Lemma \5.4\ In order to evaluate the integrals we will use (|lip , ([22 
and (|23p . and the orthogonality relations of the exponentials 



(70) e{{\,x))dx 



|1 A = 0, 
I otherwise. 

T ^ 

Most of the computations are similar in nature, and we will only show a few 
examples in detail, omitting the rest. 

The statement of part [1] of the present lemma concerning the second 
moment of r is evident in light of (|lip and ()70p . For the 4th moment, we 
have 

(71) / r{xfdx = 4rASi{^)l 



where 



54(n) = |(Ai,...A4)G A„: 2^^^^ = o| 

is the length-4 correlation set of frequencies (cf. (jlSp ). Note that since two 
circles may have at most 2 intersections (i.e. circles of radius -y/n centered 
at and Ai + A2), (Ai, . . . , A4) G «S'4(n) implies that either of the following 
holds: 

(Ai = — A2 and A3 = — A4) or 
(72) (Ai = -A3 and A2 = -A4) or 

(Ai = — A4 and A2 = —A3). 

Conversely, every tuple of either of the forms above is lying inside 5*4(4). In 
particular, 

|54(n)| =3AA2 fl + O ^ ^ 



the error term being an artifact of the existence of degenerate tuples of the 
form 

(±A,±A,±A,±A) G 54(4) 

(with precisely two plus and two minus signs). Part [1] of the present lemma 
then follows upon substituting the latter into (j7ip . We will use the fine struc- 
ture (j72p of 5*4 (n) in the course of the proof of most of the other statements 
of the present lemma. 

Now we turn to part [2] of the present lemma. While the first statement 
is clear from ()22p and (j70p . to show the other statement, we invoke the fine 
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structure (j72]) of 84(71). We have 

(2vr)4 



/ 



{D{x)D{xyydx 



A1A2A3A: 



(Ai,...,A4)eS4(n) 



^ (Ai,A2)2+ AiA*A2Ai + 0(AA7i2) 

Ai,A2 6A„ Ai,A2eA„ 



The result of the present computation then follows upon making the simple 
observations 



AeA„ 



for every C G (see [22], Lemma 5.2) and 



E A*A = l^nn ■ h 



A6A„ 



To show part [3] we note 



r(xYDix)DixYdx 



(Ai,...,A4)eS'4(n) 



and only tuples with A3 = — A4 contribute to the latter summation, i.e. 
those of the first type in (|72p . The computation for part H] is very similar 
to what we encountered before, using ()23p with ()70p for the first statement, 
and exploiting the fine structure ([72]) of ^4 (n) for the second one 

iT{H{xf)dx = E 4A3AiA4. 

J (Ai,...,A4)e54(n.) 

To compute the integrals in part [5] we exploit Lemma 18.11 Similarly to 
the previous computations, we have by ([23]) and ([70]) 

I tT{H{xf)dx = E tr(AiAiA* A2A* A3A* A4) 



(47r 



2^4 



(Ai,...,A4)eS4(n) 

E tr (A* AiA*AiA* A2A*A2) + E tr (A* AiA*2A2A* AiA*2A2) 

Ai,A2eA„ Ai,A2eA„ 

En 



E tr (A*iAiA*2A2A*2A2A*iA 

Ai,A2eA„ 



+ 



r? E (^i'^2>'+ E (Ai,A2>^ + n2 (Ai,A2> 

Ai,A2GA„ Ai,A26A„ Ai,A2eA„ 

Ml. 



^(l + i?4(n)) + 



+ 



where we used sums as above and the definition ([69p of B^iji). Using Lemma 
18.11 we may then rewrite the latter expression in terms of /In (4), as in the 
statement of the present lemma. 
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A similar computation shows that the second integral in part [5] is given 

by 

I tr{H{xffdx = ^ (1 + 2i?4(n)) + O 

and using Lemma [8.11 again yields the result given. Evaluating the integrals 
for parts [6ll8] of the present lemma is straightforward and very similar to the 
above computations, and we omit it here. 

We now prove part [9] of the present lemma. We have by symmetry 

(73) j {D{x)D{xYfdx = j \\D{x)\fdx<, J dx, 



T T T 

whence 

= Kf^ Z2 ^i-----^l'^irre-\S6{n)\=E^ r{xfdx, 

" Ai,...,A6eA 

by ()17p . where we used the uniform bound 

\>^}\ <Vn<.VE. 

The present statement of Lemma 15.41 then follows upon substituting (|74p 
into ()73p . The proofs for the last parts [10] and [11] of the present lemma are 
very similar and we omit them here. I 
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